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During B cell development, long-distance DNA interactions are needed forV(D)J somatic 
rearrangement of the immunoglobulin (Ig) loci to produce functional Ig genes, and for 
class switch recombination (CSR) needed for antibody maturation. The tissue-specificity 
and developmental timing of these mechanisms is a subject of active investigation. A small 
number of factors are implicated in controlling Ig locus long-distance interactions including 
Pax5, YinYang 1 (YY1 ), EZH2, IKAROS, CTCF; cohesin, and condensin proteins. Here we will 
focus on the role of YY1 in controlling these mechanisms. YY1 is a multifunctional transcrip- 
tion factor involved in transcriptional activation and repression, X chromosome inactivation, 
Polycomb Group (PcG) protein DNA recruitment, and recruitment of proteins required for 
epigenetic modifications (acetylation, deacetylation, methylation, ubiquitination, sumoyla- 
tion, etc.). YY1 conditional knock-out indicated thatYYI is required for B cell development, 
at least in part, by controlling long-distance DNA interactions at the immunoglobulin heavy 
chain and IgK loci. Our recent data show thatYYI is also required for CSR. The mechanisms 
implicated inYYI control of long-distance DNA interactions include controlling non-coding 
antisense RNA transcripts, recruitment of PcG proteins to DNA, and interaction with com- 
plexes involved in long-distance DNA interactions including the cohesin and condensin 
complexes. Though common rearrangement mechanisms operate at all Ig loci, their distinct 
temporal activation along with the ubiquitous nature of YY1 poses challenges for determin- 
ing the specific mechanisms of YY1 function in these processes, and their regulation at 
the tissue-specific and B cell stage-specific level. The large numbers of post-translational 
modifications that control YY1 functions are possible candidates for regulation. 
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THE EARLY DAYS 

Yin Yang 1 ( YYl ) was first identified in 1 985 as a factor that yielded 
an in vivo B cell-specific DMS methylation interference pattern 
over the immunoglobulin heavy chain (IgH) intron enhancer (1, 
2). The enhancer site that bound YYl was defined as the p.El site 
(3) and nuclear factors that bound to this sequence were identified 
by EMSA (4). Our laboratory isolated a cDNA clone expressing a 
protein that bound to the IgK3' enhancer as well as the IgH (lEl 
site and named the protein NF-El (5). Simultaneously the factor 
was cloned by Tom Shenk's laboratory and named YYl (6) based 
on its ability to bind the adenoviral PI promoter and both activate 
and repress transcription, by Robert Perry's laboratory and named 
delta (7) due to its binding to the delta motif in the promoters 
of ribosomal protein genes, and by Keiko Ozato's laboratory and 
named UCRBP based on its ability to bind to the upstream con- 
trol region of retroviral LTRs (8). Ultimately, the name YYl was 
adopted by all. 

Yin Yang 1 contains four zinc fingers at its carboxyl termi- 
nus (amino acids 298-414) and a region rich in alanine and 
glycine between amino acids 154 and 201. The first 100 amino 
acids of YYl encode several notable features. Sequences 43-53 
contain 1 1 consecutive acidic residues while amino acids 70-80 
consist of 1 1 consecutive histidine residues. These two segments 
are separated by a region rich in glycine (residues 54-69). In addi- 
tion, sequences 16-29 have the potential to form an amphipathic 



negatively charged helix and sequences 80-100 are rich in proline 
and glutamine. Sequences near the carboxyl terminus (333-397), 
which overlap the YYl zinc fingers, and sequences 170-200 have 
been reported to be involved in transcriptional repression (6, 9- 
15). These sequences are known to physically interact with a vari- 
ety of transcriptionally important proteins including TBP, p300, 
c-myc, and HDAC2 (16). YYl sequences important for transcrip- 
tional activation reside near the amino-terminus (9, 12, 13, 17). 
Figure 1 shows various sequence features and functional domains 
of YYl. 

DIVERSE AND COMPLEX ROLES OF YYl 

Over the past 22 years, multiple diverse YYl functions have been 
identified. YYl is crucial for embryonic development because 
homozygous mutation of the yyl gene in mice results in peri- 
implantation lethality (18). YYl is implicated in lineage differen- 
tiation of skeletal and cardiac muscle, and in cell growth control 
(13, 17, 19-24), as well as disease pathways such as dystrophic 
muscle disease (25-27). YYl and its target genes are also believed 
to be central regulators of germinal center B cell development 
(28), and YYl has been suggested to regulate genomic targeting of 
activation induced cytidine deaminase (AID) (29). YYl is impli- 
cated in a number of cancers (30-32), and is overexpressed in 
B cell lymphomas that depend on AID function. YYl is associ- 
ated with B cell transformation and tumor progression in diffuse 
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FIGURE 1 I Diagram of YY1 domains and functions. Domains of YY1 are 
indicated with specific functions listed. The regions of similarity to 
Drosophila Plieohomeotic are indicated below/ the diagram. 



large B cell lymphoma (DLBCL) (33, 34), and high levels of YYl 
expression are associated with reduced patient survival in DLBCL 
as well as follicular lymphoma. CTCF-YYl elements are clus- 
tered in the imprinting domain of Tsix (35) and YYl docks Xist 
particles on the X chromosome via DNA and RNA interactions 
during X chromosome inactivation (36). YYl can also control 
imprinting at the Peg3 and Gnas domains (37). YYl can con- 
trol human immunodeficiency virus (HIV) gene expression and 
viral titers, and deletion of YYl binding sites in regulatory regions 
of human papilloma viruses correlates with increased viral gene 
expression and the development of cervical cancer (38-46). Thus, 
YYl function is related to transcriptional regulation, embryonic 
development, X-chromosome inactivation, imprinting, oncogen- 
esis, viral gene expression, epigenetic function, and a growing list 
of diseases. 

IDENTIFICATION OF THE PcG FUNCTION OF YYl 

A significant new function of YYl was suggested in 1998 when 
the Kassis laboratory cloned the Drosophila Pleiohomeotic (PHO) 
sequence and observed similarity to YYl (47) (Figure 1). Gir- 
ton and Jeon (48) demonstrated that PHO is a Polycomb Group 
(PcG) protein, a family of proteins involved in epigenetic chro- 
mosomal condensation, stable transcriptional repression, control 
of cell proliferation, hematopoietic development, as well as stem 
cell self-renewal. This raised the exciting possibility that YYl is a 
vertebrate PcG protein. PHO is highly homologous to YYl in two 
regions. These two regions include YYl sequences 296-414 and 
205-226 (the corresponding segments in PHO are residues 357- 
475 and 148-169, respectively). Sequences 298-414 constitute the 
four YYl zinc fingers. The homology over this region is extraordi- 
nary for organisms as diverse as flies and humans (112 identities 
out of 118; 95%). Within this segment, zinc fingers 2 and 3 are 
100% identical. The 205-226 segment is also highly homologous 
(18/22; 82% identity). Outside of these regions of high similarity, 
YYl and PHO showed no discernible similarity. PHO does not 
contain an obvious transcriptional activation domain and lacks 
YYl structural features such as acid and histidine stretches. How- 
ever, the two regions of high similarity between YYl and PHO, and 



their similar spatial locations within the proteins, suggested that 
they might carry out some of the same functions in vertebrates 
and flies, respectively. 

Prompted by the possibility that YY 1 functions as a PcG protein, 
we tested this hypothesis using a Drosophila in vivo transcription 
system, as well as a phenotypic correction assay. Our results showed 
that human YYl does indeed function as a PcG protein in vivo 
(49-31). We found that YYl can repress transcription in a PcG- 
dependent fashion, can phenotypicaUy correct pho mutant flies, 
and can recruit PcG proteins to specific DNA sequences result- 
ing in tri-methylation of H3 lysine 27 (49-51). The mechanisms 
responsible for targeting mammalian PcG proteins to specific 
DNA regions has long been proven enigmatic because none of the 
components of the PcG complexes bind to specific DNA sequences, 
yet the PcG complexes associate with specific DNA regions in vivo. 
Our demonstration that YYl is a mammalian PcG protein with 
high affinity sequence-specific DNA binding activity suggested 
that YYl is a crucial factor for targeting specific proteins to specific 
DNA sequences. The role of YYl in PcG targeting has been con- 
firmed in a number of studies (52-55) though clearly other factors 
are involved as YYl (and PHO) does not co-localize with PcG pro- 
teins in all cell types (56-58). A particularly exciting aspect of YYl 
PcG function is that PcG proteins are known to contribute to B cell 
development, and the PcG protein EZH2, like YYl, is required for 
Ig locus contraction (further explained below) (59). Nucleation of 
PcG proteins to specific target DNA sites by YYl within the Ig loci 
thus opens up a new avenue for mechanistic evaluation of B cell 
development and Ig locus contraction, because PcG proteins are 
capable of mediating long-distance DNA interactions (60). 

THE YYl REPO DOMAIN 

Using a fly transgenic approach, we set out to identify the YYl 
sequences involved in PcG function (61). We found that the 
region of 82% YYl -PHO identity (the 25 amino acids between 
residues 201 and 226), when fused to a heterologous GAL4 DNA 
binding domain, was necessary and sufficient for PcG-dependent 
transcriptional repression. Amazingly, this smsill 25 amino acid 
segment was also necessary and sufficient for recruitment of PcG 
proteins to DNA resulting in tri-methylation of H3 lysine 27. 
Therefore, we named YYl sequences 201-226 the REPO domain 
for their ability to REcruit Polycomb (61). A REPO domain YYl 
mutant ( A201-226) can mediate nearly all YYl functions such as 
DNA binding, transcriptional activation, transient transcriptional 
repression, and interaction with HDAC proteins. However, this 
mutant fails to carry out YY 1 PcG functions and fails to recruit PcG 
proteins to DNA (61). How the YYl REPO domain recruits PcG 
proteins to DNA is now being elucidated. Two homologous pro- 
teins, YAF2 and RYBP, were previously identified as YYl interacting 
proteins (62, 63). Functionally, RYBP associates with a subset of 
PcG complexes named PRC1L4 (64) and is involved in the repres- 
sive function of hoxD11.12, a mammalian "PRE-like" sequence 
(65). YAF2 was first identified by its ability to bind to YYl (63) 
and we found YAF2 can interact with the REPO domain perhaps 
functioning as a bridge protein in PcG recruitment (52, 66). The 
importance of the YYl REPO domain for B cell development is 
discussed below. 
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STRUCTURE OF IMMUNOGLOBULIN LOCI DURING B CELL 
DEVELOPMENT 

B cell development involves progression from Lin^Sca-l+c-kit+ 
(LSK) progenitor cells through a number of intermediate B cell 
stages including pro-B, pre-B, immature B, mature B, and plasma 
cell stages. The early stages of B cell development can be delin- 
eated by the rearrangement status of the immunoglobulin heavy 
and light chain genes. Both heavy and light chain genes are pro- 
duced during early, antigen-independent B cell development by a 
somatic rearrangement process that links together either V, D, and 
J segments (heavy chain), or V and J segments (light chain) to pro- 
duce functional Ig genes (67-70). The Ig loci are huge (2.4-3.2 Mb) 
and for rearrangement of distal variable region genes to occur, the 
loci must go through a physical contraction process. Prior to the 
onset of rearrangement, Ig loci reside at the nuclear periphery in an 
"extended" configuration. However, at the pro-B cell stage, when 
the heavy chain genes undergo rearrangement, the loci take up an 
intranuclear localization with concomitant contraction of the loci 
(heavy chain first followed by light chain) (71-74). While IgH DJ 



and proximal Vh to D and Vk to Jk rearrangements can occur 
without contraction, the distal V genes require locus contraction 
and looping for rearrangement (71-73, 75-77). 

Current data suggest that the Ig loci are organized as loops 
into rosette-like structures separated by spacer DNA (76, 78- 
80). A number of domains have been identified at the IgH locus, 
which adopt various conformations during development (76, 78- 
80). At the pre-pro-B cell stage, these rosette domains are in an 
extended conformation, but in pro-B cells the structure changes 
such that each V region domain is repositioned with all Vh regions 
approximately equidistant to the Dh and Jh regions, thus afford- 
ing roughly equal access for recombination (79, 80) (Figure 2, left 
panel). Similar structures are believed to exist at the Ig kappa locus 
at pro-B and pre-B cell stages (Figure 3). 

The mechanisms that control Ig locus contraction are 
unknown. A small number of transcription factors or protein com- 
plexes (YYl, Pax5, CTCF, IKAROS, cohesin, condensin, EZH2) are 
implicated in the DNA loops needed for V(D)I rearrangement (59, 
78, 8 1-86), but the molecular details and regulatory processes that 
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FIGURE 2 I Immunoglobulin heavy chain locus diagram with V, D, 
J, and C regions indicated and locations of known YYl binding 
sites, and the approximate positions of long-distance DNA loops 
that areYYI-dependent. The red circles represent the IgH intron and 



3'RR enhancers. The left panel models rosette-like loops 
encompassing the Vh regions at the pre-pro-B and pro-B cell stages. 
The right panel diagrams the Eii-3'RR long-distance DNA loop 
required for CSR. 



www.frontiersln.org 



February 2014 | Volume 5 | Article 45 | 3 



Atchison 



YY1 and DNA loops 



Known YYl Binding Sites 




Postulated YYl-dependent Loops 




Pre-B 



* — YYl Dependent Loops 



FIGURE 3 I IgK locus diagram showing the location of known YY1 binding sites, and the postulated YYl-dependent loops required forVic-Jic 
rearrangement. Locations of tine Iqk intron and kE3' enhancers and shown by red circles and arrows. Postulated locus structure in pro-B and pre-B cells is 
show below. 




control this mechanism are not clear. Pax5 binds to multiple repeat 
sequences in the distal region of the IgH locus (PAIR sequences) 
and is believed to participate in rearrangement of distal Vh genes 
(83). Non-coding antisense transcripts expressed across the PAIR 
sequences correlate with VDJ rearrangement and are postulated to 
be involved with IgH locus contraction (83, 87, 88). Pax5 controls 
some of these transcripts (83), and recently YYl was shown to 
regulate antisense transcripts across at least two PAIR sequences 
(87). Many Pax5 and YYl potential binding sites exist in the IgH 
locus (89) and these transcription factors co-localize at some of 
these sites (87). Similar to the Pax5 and YYl knock-out pheno- 
types (discussed below), PcG protein EZH2 knock-out results in 
arrest at the pro-B cell stage with impaired distal Vh to Dh-Jh 
rearrangement (59). CTCF and cohesin have been argued to reg- 
ulate Ig locus structure and to control interactions of Dh and Jh 
regions with proximal Vh segments and Jk regions with proximal 
Vk segments (81, 82, 90-92). Ikaros knock-out also impacts IgH 
rearrangement as well as locus contraction (93). 

THE ROLE OF YYl AND THE REPO DOMAIN IN B CELL 
DEVELOPMENT 

Yin Yang 1 has long been believed to play some role in 
immunoglobulin (Ig) gene regulation and B cell biology because 
it associates with multiple Ig enhancer elements including the 



heavy chain intron and 3' enhancers, the Ig kappa 3' enhancer, 
as well as to a site between the Ch yl and Y2b exons (1-5, 87, 
94) (Figures 2 and 3). The Shi laboratory at Harvard provided 
insight into the role of YYl in B cell development by demon- 
strating that conditional knock-out of YYl in the B cell lineage 
(using mbl-CRE which is expressed early after B lineage commit- 
ment) resulted in arrest at the pro-B cell stage (84). Pro-B cells 
lacking YYl have normal Dh-Jh recombination but reduced fre- 
quency of Vh-Dh-Jh recombination, with the defect being most 
severe for more distal Vh genes (84). These knock-out pro-B cells 
showed a defect in Ig locus contraction (84), and this phenotype 
has been confirmed by a number of studies (81, 88). Thus, con- 
ditional knock-out of YYl using mbl-CRE results in arrest at the 
pro-B cell stage, lost Ig locus contraction, and reduced rearrange- 
ment of distal V genes. Importantly, despite the fact that proximal 
VDJ recombination does occur, very few mature B cells are gen- 
erated in conditional knock-outs. Furthermore, introduction of a 
rearranged heavy chain gene only partially complements the YYl 
conditional knock-out phenotype, suggesting additional roles for 
YYl in early B cell development (84). 

Intrigued by the similarity between the YYl and PcG protein 
EZH2 B cell knock-out phenotypes (59, 84), we set out to deter- 
mine the importance of YYl PcG function for B cell development. 
Using YYl wUd-type and YYIAREPO retroviral constructs, we 
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transduced bone marrow from yyl^^f mbl-CRE mice and injected 
this transduced bone marrow into irradiated secondary recipi- 
ents. Thus, within the B cell lineage of the transplanted mice, only 
the transduced YYl constructs will provide YYl function due to 
deletion of the endogenous yyl gene by mbl-CRE action. While 
wild-type YYl largely restored B cell development, the YY 1 A REPO 
reconstituted cells arrested B cell development at the pro-B and 
pre-B cell stages (85). Interestingly, IgH VDJ rearrangement was 
largely normal, but IgK rearrangement showed a dramatically 
skewed repertoire. Only a small number of Vk genes underwent 
rearrangement with one third of rearrangements to the most distal 
5' V kappa gene. This dramatic result suggested that in the absence 
of YYl PcG function, most of the DNA loops at the Igic locus 
needed for IgK rearrangement were abrogated, and a small num- 
ber of loops that are independent of YYl PcG function remained 
for IgK Vk-Jk rearrangements. At least some of these loops may 
require E2A or Pax5 (85), although this is speculative. 

MECHANISMS OF Ig LOCUS CONTRACTION 

The dramatically skewed Vk-Jk rearrangement profiles in 
YYIAREPO compared to wild-type YYl mice (85), suggested a 
possible direct effect of YYl on IgK locus structure, and loss of IgH 
locus contraction in a YYl knock-out background suggested par- 
allel effects at the heavy chain locus. Consistent with a direct effect 
on IgK locus structure, RNAi knock-down of YYl in bone marrow 
cultures reduced IgK rearrangement at a subset of Vk genes (85). 
Since the Shi lab showed YYl is important for Ig locus contraction 
(81, 84, 88), we hypothesized that clusters of YYl binding sites 
exist across the Ig loci, and that YYl binding to these sites would 
result in recruitment of proteins needed for Ig locus contraction. 
As predicted, we identified clusters of YYl binding sites across 
the IgK locus that binds to YYl (85). We found that PcG protein 
EZH2 co-localized with YYl at these sites apparently as a result 
of recruitment by YYl (85). We also identified several proteins 
that physically interact with the YYl REPO domain providing 
potential insight into the mechanism of YYl function in locus 
contraction. Intriguingly, we found that proteins from the con- 
densin and cohesin complexes (SMC4 and SMCl) that are needed 
for contraction of chromosomes during mitosis (95-99), as well 
as lamin proteins, bind to the YYl REPO domain. Lamin proteins 
are known to be involved in long-distance DNA interactions ( 1 00- 

103) . Similarly, cohesin and condensin complexes, along with 
topoisomerase 2, are involved in mitotic chromosome contraction 
and higher order chromosome organization and dynamics (96, 

104) . During mitosis, condensin and cohesin proteins associate 
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FIGURE 4 I Summary of protein co-localization data across the Igic locus 

The IgK locus is shown in the top panel with Vk genes represented by vertica 
lines. The identified YYl binding sites are represented by black circles. 



with the chromosomes and function in chromosomal contraction, 
cohesion, assembly, and segregation (96-98). A subpopulation of 
these proteins remains chromosome-associated at specific foci in 
the interphase nucleus (98). Importantly, cohesin and condensin 
proteins are involved in numerous long-distance DNA interac- 
tions (92, 105-114). Therefore, we hypothesized condensin and 
cohesin proteins associate with Ig loci in pro-B and pre-B cells by 
virtue of interaction with YYl, and thereby function to partici- 
pate in Ig locus contraction. Consistent with this idea, we found 
that condensin proteins associate with the clusters of YYl binding 
sites that we identified within the IgK locus (85) in primary pro-B 
cells, but not in fibroblasts suggesting a B cell specific function of 
condensin and cohesin association with these sites (see Figure 4). 

To test the functional consequences of YYl and condensin bind- 
ing at the Ig kappa locus, we performed RNAi knock-down and Ig 
kappa rearrangement assays. We found that knock-down of YYl 
or condensin proteins resulted in reduced IgK rearrangement at a 
subset of Vk genes (85). Thus, YYl binds to sites in the Ig loci, per- 
haps recruits PcG, condensin, cohesin, and lamin proteins to these 
sites, and results in specific Ig locus chromosomal contraction. The 
identification of condensin mutants that specifically affect T cell 
development supports the idea of condensin proteins (which are 
ubiquitously expressed) having lymphoid specific functions (115). 
These complexes can mediate long-distance chromosomal inter- 
actions (105, 107), and kleisin-p, a member of the condensin II 
complex is important for T cell development as is cohesin subunit 
Rad21 (92, 115). Cohesin subunit Rad 21 (a kleisin family protein) 
is recruited to CTCF binding sites throughout the Ig loci during 
B lymphocyte development (82). As condensin I is involved in the 
process of physically compacting DNA in the presence of hydrolyz- 
able ATP (116), condensin complex proteins may also participate 
in bringing V genes in the Ig locus into close proximity with D and 
J gene segments. 

LONG-DISTANCE DNA INTERACTIONS AND CSR 

Long-distance DNA loops are also required for class switch recom- 
bination (CSR), which recombines the rearranged VDJ segments 
that provide antibody specificity with various Ig heavy chain con- 
stant (C) regions with different effector functions (117, 118). CSR 
requires a large 220 kb long-distance DNA loop synapse between 
the IgH intron enhancer (E(i) region, and the 3'RR enhancer 
downstream of the 3'-most Ca exon (119, 120) (the E|i-3'RR 
synapse; see Figure 2, right panel). In addition, CSR to individ- 
ual IgH C exons requires formation of inducible DNA loops from 
each switch region DNA sequence into the E|jL-3'RR synapse (119, 




Summary of ChIP data forYYl, EZH2, SMC4, SMC2, and BRRN1 are shown 
in the bottom panel. Positive ChIP signals are represented by a + symbol and 
question marks show inconclusive ChIP data. 
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120). Over 40 proteins are involved in the enzymology and mech- 
anism of CSR and include DNA repair (base excision repair and 
mismatch repair) proteins, DNA damage sensors, factors that alter 
chromatin structure, factors that bind to AID, and transcriptional 
regulatory proteins [reviewed in Ref (121)]. However, none of 
these factors are known to specifically impact the E|x-3'RR DNA 
loop required for CSR. 

Recent progress, however, has shed light on these long-distance 
DNA loops. CTCF and cohesin bind to the IgH 3'RR enhancer 
within the hs5-7 sites (81, 122, 123), and cohesin binding is 
induced at certain Ch switch regions in response to inducers of 
CSR implying a function for cohesin in CSR (123). Consistent 
with this, knock-down of cohesin subunits impairs CSR (123). 
In addition, knock-down of the cohesin loading protein NIPBL 
reduces CSR, reduces non-homologous end joining, and increases 
microhomology end joining (124). Interestingly, AID was shown 
to physically interact with condensin, cohesin, and INO80 com- 
plex proteins (123), precisely the same complexes that bind to YYl 
(85, 125, 126). 

Notably, we found that YYl conditional knock-out in splenic B 
cells significantly reduces CSR ( 1 2 7 ) . YY 1 physically interacts with 
AID, leading to stabilization and increased AID nuclear accumu- 
lation, and this control of AID nuclear levels can regulate CSR. 
Control of nuclear levels of AID is crucial not only for regu- 
lating antibody maturation processes (CSR and somatic hyper- 
mutation), but also is important for maintaining integrity of the 
mammalian genome. Elevated levels of YYl could cause aberrant 
accumulation of AID in germinal center B cells leading to increased 
mutagenesis andlymphomagenesis. Indeed, YYl levels are elevated 
in germinal center-derived human DLBCL (34), suggesting that 
YYl contributes to disease progression. However, we also found 
that YYl has a second function important for CSR. In collabo- 
ration with Ranjan Sen (NIA), we found that YYl is necessary 
for long-distance DNA loops formed between the Ep. and 3'RR 
enhancers (unpublished data). Recently, Kenter and colleagues 
identified a long-distance DNA loop between the E(i and hs3b- 
hs4 sites of the 3'RR that is dramatically induced upon induction 
of CSR in splenic B cells (119). We found that this long-distance 
DNA loop is YYl -dependent (unpublished data). Thus, YYl con- 
trols long-distance DNA loops in splenic B cells that are critical for 
CSR. Can the same be said of the long-distance DNA loops needed 
for IgH V(D) J rearrangement, and perhaps for other long-distance 
DNA loops? Recent evidence suggests this is the case. 

YY1-DEPENDENT IgH LONG-DISTANCE DNA INTERACTIONS 

The Sen Laboratory and colleagues indentified long-distance DNA 
loops in both the Vh distal and proximal regions, and at the 3' end 
of the locus (78). They found YYl bound to many of these seg- 
ments and postulated either homotypic YYl interactions to medi- 
ate these loops, or heterotypic interactions with other proteins 
(78). The essential nature of YYl for these loops was subsequently 
demonstrated. In pro-B cells, YYl conditional knock-out ablates 
long-distance DNA loops between the E\l region and the distal and 
proximal Vh regions (87). In addition, YYl knock-out in pro-B 
cells ablates loops between the E|x region and the 3'RR enhancer, 
hs5-7 region (87). Thus, YYl is essential for long-distance DNA 
loops within the IgH locus involved in either VDJ rearrangement. 



or CSR (Figure 2). Finally, YYl is also involved in long-distance 
DNA interactions at the Th2 cytokine locus and controls IL4, IL5, 
and IL13 expression (128). These dramatic results indicate that 
YYl is required for long-distance DNA loops that control IgH 
V(D)J rearrangement, CSR, and gene regulation. Our studies at 
the IgK locus (85) also indicate a role for YYl in long-distance 
DNA interactions needed for IgK rearrangement (Figure 3). 

REGULATORY MECHANISMS FOR YYl FUNCTION 

How might YYl be functioning in these diverse long-distance DNA 
interactions? As described above, in pro-B cells, YYl binds consti- 
tutively to the E|i enhancer, to hs5-7 sites in the 3'RR enhancer, 
to a site between the Cyl and Cy2b exons, and inducibly to the 
hs3b site in the 3'RR enhancer in splenic B cells (5, 78, 87, 94). The 
mechanism of regulation of developmental stage-specific function 
of YYl in VDJ rearrangement at the IgH locus (pro-B cells), in Vk- 
Jk rearrangement at the IgK locus (pre-B cells), and in CSR at the 
IgH locus (mature splenic B cells), is presently unknown. YYl 
may participate in regulatory stage-specific functions to control 
locus accessibility (129), but other factors may control accessibility 
enabling subsequent YYl DNA binding. 

Yin Yang 1 function can be regulated by a number of mecha- 
nisms. Stage-specific regulation could be at the level of YYl DNA 
binding, such as the LPS inducible binding in the 3'RR enhancer 
in splenic B cells. YYl binding to the Ig heavy chain 3'RR hyper- 
sensitive site 3b (hs3b) as well as to the E|x enhancer is inducible by 
LPS (94). In this case, YYl appears to be sequestered from DNA in 
resting B lymphocytes through interaction with hypophosphory- 
lated retinoblastoma protein (Rb). However, after LPS induction, 
Rb becomes hyperphosphorylated and releases YYl enabling it 
to bind to the hs3b and E[i enhancers. Interestingly, hs3b and 4 
hypersensitive sites are crucial for formation of E|x: 3'RRl enhancer 
synapses with germline switch region promoters after cytokine 
treatment (119, 120). We hypothesize that LPS induction of CSR 
might partially result from induction of YYl binding to the 3'RR 
and E(x enhancers leading to induced DNA loop formation. 

Alternatively, YYl may be controlled by stage-specific post- 
translational modifications, or by stage-specific interaction with 
other proteins. A number of YYl post-translational modifica- 
tions can regulate YYl DNA binding (phosphorylation of serines 
180 and 184, and threonines 348 and 378) (130-132), and YYl 
is sumoylated on lysine 288 (133), which can control protein- 
protein interactions. Phosphorylation of serines 180 and 184 is 
mediated by Aurora B kinase and expression of this kinase peaks 
in splenic germinal center B cells (www.immgen.org) when CSR 
is active. Several studies demonstrated that YYl subcellular local- 
ization is regulated during cell cycle progression and development 
(132, 134-137) suggesting that YYl might also regulate subcellular 
localization of interacting partner proteins. In addition, apoptotic 
stimuli promote rapid translocation of YYl from the cytoplasm 
to the nucleus in asynchronous HeLa cells (138). Thus, YYl might 
function to increase transport of proteins from the cytoplasm to 
the nucleus via the nuclear pore. 

During B cell development, YYl expression levels remain rela- 
tively constant, as defined by transcript levels (www.immgen.org). 
However, YYl protein levels are regulated in some systems yielding 
biological responses. This is most well studied in skeletal muscle 
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Condensin complex 



FIGURE 5 I Model of YY1 recruitment of proteins to DIMA needed for 
long-distance DNA interactions. Using tlie IgH locus as a model, YY1 
binding sites are indicated. Binding by YYl then results in recruitment of 
condensin, cohesin, and PcG complex proteins. These proteins may form 
homotypic or heterotypic interactions to mediate long-distance DNA 



interactions. Positions of the E|j. enhancer, 3'RR enhancer, and various V 
genes are shown by arrows. Black rectangles represent various Ch constant 
regions. YYl also physically interacts with AID, and AID is able to interact with 
condensin, and cohesin complexes, and thus may contribute to DNA loop 
formation in germinal center B cells. 



differentiation systems where YYl expression levels drop as a result 
of proteolysis (24), and in cardiac disease conditions (139, 140). 
Thus, regulation of YYl protein stability may control DNA loop 
formation. 

It should be noted that RNA expression profiles of PcG pro- 
teins EZH2 and YAF2, as well as cohesin, and condensin subunit 
proteins SMC4, SMC2, SMCl, SMC3, CAP-G, CAP-H (BRRNl), 
and CAP-D2 all peak during B cell development at the pre-B 
cell stage (www.immgen.org). Expression levels are also high in 
pro-B cells, but peak in pre-B cells, then drop in immature B 
cell stages. This expression pattern is coincident with the tim- 
ing of Ig rearrangement and is consistent with a role in Ig locus 
contraction and rearrangement. However, this timing is also coin- 
cident with high levels of proliferation in pre-B cells suggesting 
a possible effect of YYl on the pre-B proliferative burst dur- 
ing development. All factors peak again in germinal center B 
cells (www.immgen.org) suggesting possible roles in proliferation, 
CSR, or somatic hypermutation. 

Whatever the mode of locus accessibility or YYl DNA binding, 
YYl may then recruit proteins to DNA that are required for long- 
distance DNA interactions. As presented above, YYl physically 
interacts with PcG, condensin, cohesin, and lamin proteins, all 



involved in long-distance DNA interactions, and we have noted 
co-localization of some of these proteins with YYl at the IgK 
locus (Figure 4). PcG proteins can mediate long-distance DNA 
interactions (60), and since YYl recruits PcG proteins to DNA 
via the REPO domain (50, 61), we predict that this interaction 
will be important for long-distance interactions leading to DNA 
loop formation. Notably, condensin and cohesin complex pro- 
teins (105, 107), and lamin proteins (100-103) are all involved in 
long-distance DNA structures, suggesting that the DNA binding 
capacity of YYl at IgH and IgK sequences may nucleate protein- 
protein interactions that govern DNA looping mechanisms. In 
addition to co-localization of YYl and condensin proteins at the 
IgK locus, YYl co-localizes with cohesin at the hs5-7 sites in the 
3'RR enhancer (78, 81). 

MODELS OF YYl -MEDIATED LONG-DISTANCE DNA 
INTERACTIONS 

Based upon: (a) the crucial nature of the YYl REPO domain for 
B cell development, (b) the ability of this domain to recruit PcG 
proteins to DNA, (c) the physical interaction of the REPO domain 
with PcG, condensin, cohesin, and lamin proteins, (d) the co- 
localization of YYl, EZH2, and condensin proteins across the IgK 
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locus, (e) the co-localization of YYl and cohesin proteins at the IgH 
3'RR enhancer, (f) the effect of cohesin knock-down on CSR, (g) 
the effect of condensin subunit knock-down on Vk-Jk rearrange- 
ment, (h) the high levels of EZH2, YAF2, cohesin, and condensin 
proteins in pro-B, pre-B, and germinal center cells, (i) the critical 
role of YYl in long-distance DNA loops in the IgH V region and 3' 
region, and (j) the regulatory role of YYl in CSR, we propose the 
following mechanism. We propose that YYl binds to sites span- 
ning the IgH and IgK loci. Concomitant with YYl DNA binding, 
increased EZH2, YAF2, cohesin, and condensin subunit expression 
results in these proteins binding to the same DNA regions, presum- 
ably due to interactions with YYl. The nucleated PcG, cohesin, 
and condensin proteins then mediate long-distance interactions 
between the YYl binding sites resulting in contraction of the Ig 
loci in looped or rosette structures (Figure 5). These loops then 
control somatic rearrangement of IgH and IgK genes as well as 
CSR. Immediately upon maturation to the immature B cell stage, 
or upon maturation to plasma cells, EZH2, YAF2, cohesin, and con- 
densin protein expression drops dramatically (www.immgen.org), 
thus facilitating de-contraction of the Ig loci, perhaps assisting in 
regulation of the allelic exclusion process, and causing a decrease 
in the inducible loops needed for CSR. In the case of CSR, it is 
intriguing that AID binds to many of the same factors that bind 
to YYl (condensin, cohesin, and INO80 complexes) (123). Thus, 
YYl-AID physical interaction may also contribute to DNA loop 
formation (Figure 5). 

Finally, it has been proposed that YYl function in long- 
distance DNA interactions relates to the regulation of non-coding 
antisense transcripts in the IgH Vh PAIR sequences (88). YYl 
knock-out ablates some of these transcripts, and these transcripts 
have been proposed to play a role in IgH locus contraction (87, 
88). Some RNA transcripts are known to regulate long-distance 
DNA interactions via interactions with the mediator complex 
(141). Whether YYl functions in this mechanism is presently 
unclear. 

FUTURE STUDIES AND REMAINING QUESTIONS 

A number of outstanding questions remain. (1) Is recruitment 
to DNA of proteins involved in DNA loop formation dependent 
upon YYl DNA binding? (2) What mechanisms enable YYl to 
function at distinct loci at various developmental stages? (3) Is 
YYl function controlled by post-translational modifications? (4) 
Is YYl controlled by stage-specific protein interactions? (5) What 
functions and domains of YYl are needed for DNA looping, V(D)J 
rearrangement, and CSR? (6) What are the biochemical mech- 
anisms for Ig locus contraction and for DNA loop formation? 
These questions and others are important for immune function 
and control of gene expression. The ubiquitous nature of YYl and 
its involvement in looping at multiple loci (78, 87, 128) suggests 
that paradigms learned in the Ig systems will be globally applicable 
to other long-distance DNA interactions. 
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